3574 results found.
Written
Tokenizer,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
MIT License
Size:
None Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:A Large-Scale Corpus of E-mail Conversations with Standard and Two-Level Dialogue Act Annotations
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Motoki Taniguchi | sapCy sentencizer | /N |
Documentation:
https://spacy.io/api/sentencizer/
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Not Available
License:
Size:
360392 words Production Status:
Newly created-finished
Use:
Discourse
-
Paper title:A Large-Scale Corpus of E-mail Conversations with Standard and Two-Level Dialogue Act Annotations
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Motoki Taniguchi | Enron E-mail Conversational Corpus | /N |
Documentation:
This paper
Multimodal/Multimedia
Corpus,
Language Type:
Bilingual
Languages:
English Japanese
Availability:
Freely Available
License:
CreativeCommons
Size:
63566 sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Supervised Visual Attention for Multimodal Neural Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Tetsuro Nishihara | Flickr30k Entities JP | /N |
Documentation:
None
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
National Science Foundation, Xerox UAC and the Sloan Foundation
Size:
158915 sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Supervised Visual Attention for Multimodal Neural Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Tetsuro Nishihara | Flickr30k Entities | /N |
Documentation:
None
Multimodal/Multimedia
Corpus,
Language Type:
Multilingual
Languages:
Czech English French German
Availability:
Freely Available
License:
CreativeCommons
Size:
31014 sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Supervised Visual Attention for Multimodal Neural Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Tetsuro Nishihara | Multi30k | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
open-source
Size:
937K documents OtherProduction Status:
Existing-used
Use:
Natural Language Generation
-
Paper title:Retrieval-Augmented Controllable Review Generation
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jihyuk Kim | Amazon Book Reviews | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
100 MByte Production Status:
Existing-used
Use:
Emotion Recognition/Generation
-
Paper title:Exploiting Narrative Context and A Priori Knowledge of Categories in Textual Emotion Classification
-
Paper track:Short paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yoshihiko Hayashi | Story Commonsense dataset | /N |
Documentation:
Yes. In English.
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
Size:
500 entries Production Status:
Existing-used
Use:
Summarisation
-
Paper title:Controllable Abstractive Sentence Summarization with Guiding Entities
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Changmeng Zheng | DUC-2004 Task 1 data | /N |
Documentation:
https://www-nlpir.nist.gov/projects/duc/data/2004_data.html
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
4M entries Production Status:
Existing-used
Use:
Summarisation
-
Paper title:Controllable Abstractive Sentence Summarization with Guiding Entities
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Changmeng Zheng | Annotated English Gigaword | /N |
Documentation:
https://catalog.ldc.upenn.edu/docs/LDC2012T21/
Written
Evaluation Data,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Named Entity Recognition
-
Paper title:Learning Efficient Task-Specific Meta-Embeddings with Word Prisms
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | KC Tsiolis | CoNLL 2003 NER shared task corpus | /N |
Documentation:
None




